

METHOD AND APPARATUS FOR IMPROVED BOOKMARK AND 
HISTORIES ENTRY CREATION AND ACCESS / ^ 



BACKGROUND OF THE INVENTION 



1. 



Technical Field: 



[0001] The present invention relates generally to organizing and accessing entries in a web 
page list and in particular to organizing a web page list to easily locate and retrieve relevant 
pages based on content. Still more particularly, the present invention relates to employing 
keywords and search terms in connection with a web page list to facilitate location and retrieval 
of relevant pages based on content. 

2. Description of the Related Art: 

[0002] Almost all hypertext systems, including web browsers, provide a facility listing 
particular sites on the Intemet for quick, easy access. Such a faciUty is normally referred to as a 
bookmark or "favorites" list, a history list, a hotlist, or "channels." These lists are typically 
displayed within a browser or other communications management utility (such as Windows™ - 
Explorer) as a textual list of titles which may be selected in "point-and-click" fashion by the 
user. Often a user may add new entries to the list, alter the title or Uniform Resource Locator 
(URL) associated with a list entry, reorder entries, or delete one or more entries from the list. 
Some lists simply identify, on a rolling basis, a specified number of the last sites which a user 
visited using the browser. 
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[0003] Several organizational problems attend these lists as they are currently implemented 
and employed. First, by nature these lists offer minimal insight into the content represented by an 
entry. The problem is analogous to locating a desired book or volume within a library. To locate 
a particular volume in a library with only ten books is usually easy. Even if the cover or table of 
contents of each of the ten volumes must be inspected to determine the content, this takes only a 
few minutes, at most. Locating the same volume within a library of 300,000 books, however, can 
be much more difficult and time consuming. 

[0004] Furthermore, lists of this type continually expand as the user's web browsing 
continues, quickly growing past a generally manageable size. Users often accumulate hundreds 
of bookmarks in their bookmark lists, while history lists can expand to thousands of entries 
depending on browser settings. Although the linear nature of such lists demands organization to 
efficiently locate relevant entries, no automated mechanism for organizing these lists exists. 
Users typically do not spend the time required to organize their bookmark lists, and history list 
organization is constrained by whichever sorting mechanisms are supported by the browser (e.g., 
by URL, name, first visited, last visited, etc.). Even if organized, however, the organization may 
not provide sufficient information to enable selection of the appropriate site based upon content. 

[0005] The problem is further complicated when a group of users shares a common set of 
bookmarks, such as when one user sends a bookmark file to another or when multiple users 
employ the same machine with a common or standard bookmark and history configuration. Li 
either case, the user wishing to locate particular content from the list(s) may find the proper 
bookmark or history entry only by manually searching recursively through folders of shared 
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bookmarks, and in some cases actually retrieving the associated web pages to ascertain the 
content. 

[0006] The problem, as illustrated by the library analogy above, revolves around scale, but is 
also compounded by the fact that categorized bookmark or history entries may fit into more than 
one designated category. While "folders" within such lists are designed to provide a useful 
classification system for the entries, the categorization for a bookmark or history entry which 
best identifies the linked content may change over time and may not be intuitive across all users. 

[0007] It would be desirable, therefore, to provide a mechanism allowing the user to 
efficiently and automatically locate hotlist entries related to a topic of interest, without requiring 
substantial organizational overhead. It would fiirther be advantageous for the mechanism to 
automatically index a site to facihtate location of desired content when each bookmark or history 
entry is created. 
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SUMMARY OF THE INVENTION 

[0008] It is therefore one object of the present invention to improve organization of and 
access to entries in a web page list. 

[0009] It is another object of the present invention to organize a web page Ust for easy 
location and retrieval of relevant pages based on content. 

[0010] It is yet another object of the present invention to employ keywords and search terms 
in connection with a web page list to facilitate location and retrieval of relevant pages based on 
content. 

[0011] The foregoing objects are achieved as is now described. Each entry within a "hotlist" 
(bookmark, favorites, or history list) includes a set of keywords topically identifying the content 
of the linked page. The keywords included within META tags for HTML files of an accessed 
Web page may be automatically detected and extracted by the browser for the hotlist when the 
browser creates the entry for that page within the hotlist. Alternatively, the browser may tabulate 
terms from the page content itself to select keywords identifying the content. In either case, the 
user may selectively edit the keywords for a hotlist entry to add, modify, or delete keywords. The 
keywords are stored with the hotlist data and may be searched for a particular topic. Hotlist 
entries matching the search criteria are displayed to the user and, upon selection of a particular 
matching entry by the user, employed to automatically generate a request for the corresponding 
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page. 



[0012] The above as well as additional objectives, features, and advantages of the present 
invention will become apparent in the following detailed written description. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] The novel features believed characteristic of the invention are set forth in the 
appended claims. The invention itself however, as well as a preferred mode of use, further 
objects and advantages thereof, will best be understood by reference to the following detailed 
description of an illustrative embodiment when read in conjunction with the accompanying 
drawings, wherein: 

[0014] Figure 1 depicts a data processing system network in which a preferred embodiment 
of the present invention is implemented; 

[0015] Figures 2A-2B are data structures for bookmarks and history lists in accordance with 
a preferred embodiment of the present invention; 

[0016] Figure 3 depicts a mock-up of a user-interface for viewing and/or editing keywords 
associated with a hotlist entry in accordance with a preferred embodiment of the present 
invention; 

[0017] Figures 4A-4C are high-level flowcharts for processes of creating and maintaining 
hotlist entry keyword fields in accordance with a preferred embodiment of the present invention; 

[0018] Figure 5 depicts a mock-up of a user interface dialog enabling search of hotlist 
entries utilizing content-identifying keywords in accordance with a preferred embodiment of the 
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present invention; and 

[0019] Figure 6 is a high-level flowchart for a process of searching, retrieving, and 
displaying hotlist entries based on keyword data within the hotlist entries in accordance with the 
preferred embodiment of the present invention. 



AUS920000120US1 



-7- 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

[0020] With reference now to the figures, and in particular with reference to Figure 1, a data 
processing system network in which a preferred embodiment of the present invention is 
implemented is depicted. The data processing system network 102 includes a number of servers 
104a-104n (where n is any positive integer) storing files including World Wide Web content. 
Servers 104a-104ii are capable of selectively providing pages, typically HyperText Markup 
Language (HTML) documents and accompanying graphics, sound, or video files, in response to 
HyperText Transfer Protocol (HTTP) requests fi-om clients 1 06a-l 06n coupled to servers 104a- 
104ii via Internet 108. 

[0021] As known in the art, content within servers 104a-104n is retrieved utiUzing Uniform 
Resource Locators (URLs) providing an address mapping to the content. Each URL typically 
includes a resource identifier (e.g., "vmw"), a second level domain name (e.g., "ibm"), and a top 
level domain name (e.g., ".com"), and may optionally include a directory path and/or filename 
for the content being requested. 

[0022] As known in the art and illustrated for client 106n, each individual client 106a-106n 
typically includes an operating system 110 facilitating the interaction between the system 
hardware and various software programs and one or more user applications 112, which may 
include all manner of network and non-network dependent applications ranging firom 
spreadsheets to Internet telephony. In the present invention, one of the programs executing 
within the client 1 06n is a web browser 114 or similar data access and retrieval application. 
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[0023] The client 106n and browser 114 in the exemplary embodiment are intended for 
shared use by multiple users. Among other constituent parts, the web browser 114 will typically 
have a file and/or directory 116 for user data and preferences for a particular user, as well as a 
data structure 118 containing the user's bookmarks and a second data structure 120 containing 
the user's browsing history. In the present invention, both the bookmarks list 118 and the history 
list 120 contain keyword data 122 and 124, respectively, associated with URLs within the lists as 
described in further detail below. The term "hotlist" is employed herein to refer to any 
bookmarks, histories, or favorites lists, channels, or any other data structure containing a list of 
URLs pointing to selected content. 

[0024] Referring now to Figures 2A and 2B, data structures for bookmarks and history lists 
in accordance with a preferred embodiment of the present invention are illustrated. Figure 2A 
illustrates a bookmarks list 118, a listing of URLs which a user has selected for ready reference. 
Each entry within bookmarks list 118 includes a name 202 identifying generally the site which is 
linked (e.g., "IBM Home Page"), an URL 204 providing the link to the referenced content, an 
add date 206 identifying the date on which the corresponding bookmark was added to bookmarks 
list 118, and a last visited date 208 identifying the date on which the referenced content was last 
accessed by the user. Each entry also include a description 210 and keywords 122. 

[0025] Description 210 is typically an intelligible statement regarding the content referenced 
or the nature of the enterprise which published the reference content. Description 210 is usually 
captured firom the referenced content when a bookmark to the referenced content is created. For 
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example, the IBM Home Page includes a "description" meta tag containing an intelligible 
statement regarding the nature of the page and the enterprise publishing the page: 

<meta name="DESCRIPTION" content="The IBM corporate home page, entry point to 
information about IBM products and services"/> 

This description may be captured when the link is added to bookmarks list 118, and stored 
within description field 210 of the corresponding entry. Alternatively, the content of description 
field 210 may be manually entered by the user. Description 210 may be employed, for example, 
to provide a "flyover" pop-up description for the bookmark entry displayed whenever a pointer- 
driven cursor overlies a display region in which the bookmark entry is displayed. 

[0026] Keywords 122 is a set of terms provided to identify the content of the corresponding 
bookmarks entry, generally in an arbitrary and unintelligible list of terms. As with description 
210, keywords 122 may be captured from the meta tags of content referenced by a bookmark 
entry at the time the bookmark entry is created: 

<meta name- 'KEYWORDS" value- 'IBM Microelectronics analog mixed signal asics 
ceramic probe cards communications networking ics digital video mpeg electronic card assembly 
test foundry services palette dacs ieee 1394 phy transceivers infrared technology memory mwave 
dsp packaging interconnect embedded controllers powerpc serial storage architecture x86 
microprocessors" content="> 

Such keywords are employed by Internet search engines to index sites, but are not 
captured within a field for a hotlist entry as proposed by the present invention. 



AUS920000120US1 



-10- 



[0027] Alternatively, keywords 122 may be subject to manual entry or editing by the user. 
Unscrupulous enterprises often misrepresent content in order to attract more "hits." For example, 
a site may include a reference to "sex" or the name of a competing enterprise within the keyword 
meta tag of the content published, in an effort to draw hits from keyword searches unrelated to or 
not intended to identify the corresponding site. For this reason, user editing capability over 
keywords 122 is preferred. 

[0028] Figure 2B illustrates a history list 120, a rolling list of the last n sites (where n is any 
positive integer) visited by the user or of sites visited by the user within the defined period (e.g., 
the last 30 days), which is automatically compiled by the browser. History list 120 also includes 
a name 212 and a corresponding URL 214 for each entry. lii addition, each entry may include a 
first visited date 216 identifying the first time the user visited the corresponding site, a last 
visited date 218 identifying the most recent visitation by the user, and an expiration date 220 
identifying a date on which the corresponding entry will be deleted, where aging is employed as 
part of the browsing history algorithm. Typically, history list 120 may be sorted for display 
utilizing any of the previously described fields within each entry. 

[0029] History list 120 in the present invention includes keywords 124, an arbitrary (with 
respect to the rules of grammar) listing of terms defining the content referenced by the 
corresponding entry. As with keywords 122, keywords 124 may be automatically captured from 
the meta tags of the link content when the corresponding entry within history list 120 is created. 
Keywords 124 (and keywords 122) may be automatically updated whenever the linked site is 
revisited to reflect changes in the content published at the referenced URL. 
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[0030] In addition to capture from meta tags or manual user entry/editing, keywords 122 or 
124 within a hotlist may be generated from the referenced content itself. The HTML text for the 
content may be processed to generate a list of terms which appears within the content, which 
may be filtered based on frequency of occurrence within the content, uniqueness, or any other 
criteria. The resulting terms may be employed in lieu of, or in addition to, any terms captured 
from the meta tags of the content or entered by the user within keywords 122 or 124. 

[0031] Although only a bookmarks Hst and a history hst are depicted in Figures 2 A and 2B, 
keywords may be employed with any type of hotlist, and the techniques for selecting keywords 
may be employed regardless of the type of hotlist in which an entry is created. 

[0032] With reference now to Figure 3, a mock-up of a user-interface for viewing and/or 
editing keywords associated with a hotlist entry in accordance with a preferred embodiment of 
the present invention is depicted. The user interface dialog 300 is generated in response to a user 
selecting a hotlist entry and actuating an edit or "view properties" fiinction. The hotlist entry 
being viewed and/or edited in the example shown is a bookmarks list entry. Accordingly, the 
name 202, URL 204, description 210 and keywords 122 are all displayed within the user dialog 
300. All of the display fields for name 202, URL 204, description 210 and keywords 122 support 
editing capabiHty, allowing the user to edit the contents of those display fields. 

[0033] Referring now to Figures 4A through 4C, high level flow charts for processes of 
creating and maintaining hotlist entry keyword fields in accordance with a preferred embodiment 
of the present invention are illustrated. Figure 4A illustrates the process of generating a keyword 
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field for a newly created hotlist entry. The process begins at step 402, which depicts creation of a 
hotlist entry for content referenced by an URL. The process first passes to step 404, which 
illustrates, after retrieving the linked content, if necessary, a determination of whether the linked 
content contains a keyword meta tag. 

[0034] While existing keyword meta tags may be employed, a novel HTML meta tag such 

as: 

<meta name="HOTLISTKEYWORDS" value=". -content="> 
may alternatively be employed. Such a meta tag would contain keywords specifically intended 
by the content publisher to be automatically extracted by a browser or similar HTML facility and 
included within a hotlist data structure to facilitate location of the desired link by the user at a 
later time. 

[0035] If the linked content contains keyword meta tags, the process proceeds to step 406, 
which depicts extracting the keyword terms from the content meta tags. From step 406, or from 
step 404 if the linked content does not contain keyword meta tags, the process may optionally 
proceed to step 408, which illustrates scanning the linked content for unique or frequently 
recurring terms. A dictionary of "common" terms may be employed for this purpose, along with 
recurrence threshold(s). The content scaimed may include meta data or only displayed contented, 
but should preferably include all content data so that images, sound files and motion picture files 
may also be indexed within the keywords by their name or description. 
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[0036] If step 408 is implemented, the process may then optionally pass to step 410, which 
depicts filtering and/or augmenting the meta tag keywords from the keyword meta tags (if any) 
utilizing the content keywords identified by scanning the content. In this manner, intentionally 
misleading or misdescriptive keywords included within the keyword meta tags by the publisher 
may be eliminated if the terms do not also appear within content itself Alternatively, keywords 
for linked content may be synthesized from the content where no keyword meta tags are included 
within the published content, and meta tag keyword terms may be supplemented by content- 
derived terms. 

[0037] The process then passes to step 412, which illustrates entering the selected keyword 
terms (meta tag keyword terms, content-derived keyword terms, or come combination thereof), if 
any, into the keyword field of the hotlist entry just created. The process then passes to step 414, 
which illustrates the process becoming idle until another hotlist entry is created. 

[0038] Figure 4B illustrates a process of automatically updating the keyword field of a 
hotlist entry. The browser or other HTML facility for which the hotlist is employed may allow 
the user to selectively designate the hotlist, or particular entries within the hotlist, for automatic 
updating whenever the content linked by a corresponding entry within the hotlist is revisited. The 
process begins at step 416, which illustrates the user revisiting content linked by an entry within 
the hotlist. The user may access the content through the hotlist entry, in which case triggering of 
the update process (e.g., as part of retrieval of the content) is straightforward. Alternatively, the 
user may access the content by manual entry of the URL or from a link within other content. In 
this latter instance, the URL of the content accessed must be compared to each of the URLs 
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within the hotlist entries in order to determine whether the update process should be triggered. 

[0039] The process first passes to step 418, which illustrates a determination of whether the 
linked content has been modified since the last access of the content by the user. This may be 
determined from comparison of a "last visited" field within the hotlist entry with an update meta 
tag within the linked content, such as: 

<meta name="LASTUIPDATE" value='Tue Apr25 18:43:34 2000"!> 
The need for a keyword update for the linked content may be presumed if the content has 
been updated since it was last accessed by the user. Altematively, each entry within the hotlist 
may include a field in which the last update for the linked content, as last accessed by the user, is 
maintained. 

[0040] If the content has been updated since the user's last visit, or the need for a keyword 
update for the hotlist entry is otherwise determined to exist, the process proceeds to step 420, 
which depicts updating the keywords for the hotlist entry. Any or all of steps 404 through 412 
from Figure 4A may be employed for this purpose. It should be noted that the keyword field 
within a hotlist entry may optionally be segregated into "fixed" keywords for the content, 
selected by the user and not updated with each visit, and "dynamic" keywords which are updated 
(and deleted or overwritten, if necessary) whenever the linked content is found to have changed. 
From step 420, or from step 418 if the Hnked content is unchanged since last accessed by the 
user, the process proceeds to step 422, which illustrates the process becoming idle until another 
site linked by a hothst entry is accessed by the user. 
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[0041] Figure 4C illustrates the process of updating keyword data based on manual editing 
by the user. The process begins at step 424, which depicts the properties of a hotlist entry being 
edited by the user. The process first passes to step 426, which illustrates displaying the current 
keyword terms from the selected hotlist entry to the user, and then to step 428, which depicts 
receiving user specified keyword terms (e.g., an edited list from the display) for the entry. The 
process then passes to step 430, which illustrates entering the received keyword terms within the 
keyword field of the hotlist entry being edited, and then to step 432, which illustrates the process 
becoming idle until another hotlist entry is edited by the user. 



[0042] With reference now to Figure 5, a mock-up of a user interface dialog enabling search 
of hotlist entries utilizing content-identifying keywords in accordance with a preferred 
embodiment of the present invention is depicted. Dialog 500 allows the user to search the 
keyword substructures of a hotlist for selected search terms. Dialog 500 allows the user to enter 
one or more search term(s) within the primary keyword query term blanks 504, and to select 
logical operators using a logical operator blank 506 for logical linking of terms submitted in the 
keyword query. Any number of keyword term blanks 504 and logical operato r blanks 50 6 may 
be provided to the user, and the number of provided keyword term blanks 504 and logical 
operator blanks 506 may be expanded when those provided are all utilized. 



[0043] Upon actuation of the "Search" user control, a search engine (not shown) then 
searches the keyword data stored in the keyword section of the hotlist data structure. The user 
will typically be able to include, in addition to the search term or search terms, a series of logical 
operators that specify how the terms can be combined. For instance, the user will typically be 
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able to combine search terms with a 'logical and' operator when more than one term needs to be 
present to indicate a valid search resuh. The user will typically also be able to combine search 
terms with a 'logical or' operator when one term out of several possible search terms needs to be 
present to indicate a valid search result. The user will typically also be able to combine search 
terms with a 'logical except' operator when the presence of some search terms must accompany 
the absence of other terms in order to indicate a valid search result. Other logical search terms 
may be available and will fall within the scope of the present invention. 

[0044] Referring to Figure 6, a high-level flowchart for a process of searching, retrieving, 
and displaying hotlist entries based on keyword data within the hotlist entries in accordance with 
the preferred embodiment of the present invention is illustrated. The process begins at step 602, 
which depicts receipt of a hotUst entry search request. The process first passes to step 604, which 
illustrates displaying the search dialog to the user and receiving the search term(s) and logical 
operators (if any) from the user through the dialog. 

[0045] The process passes next to step 606, which depicts identifying matching hotlist 
entries by comparing the logical search criteria to keyword terms associated with each hotlist 
entry and determining which hotlist entries, if any, satisfy the search criteria. The hotlist entries 
which are identified as satisfying the search criteria are then sorted and displayed. The process 
next passes to step 608, which illustrates a determination of whether a displayed hotlist entry 
(which satisfies the specified search criteria) has been selected by the user. If so, the process 
proceeds to step 610, which depicts closing (or minimizing) the display and accessing the 
requested content utilizing the URL within the hotlist entry selected by the user. 
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[0046] If the user does not select a displayed hotlist entry, the process proceeds instead to 
step 612, which illustrates a determination of whether a request to close the display has been 
received (for example, by the user actuating a display control). If not, the process returns to step 
608 to continue polling for selection of a displayed hotUst by the user. If so, however, the process 
proceeds instead to step 614, which depicts closing the display of hotUst entries which satisfy the 
search criteria, and then to step 616, which illustrates the process becoming idle until another 
hotlist entry keyword search is received. 

[0047] The present invention provides an improved system of storage and retrieval for 
bookmarks in computerized information systems and grants the ability to locate relevant 
bookmarks based on the content or implications of the file represented by the bookmark. Still 
more particularly, the present invention displays of bookmarks related to a particular search term 
by use of a keyword data structure and a search and retrieval engine. It also provides a 
particularly distinctive facility for automatically extracting keyword data from the content of the 
web page and placing it in the bookmark file for later search. 

[0048] It is important to note that while the present invention has been described in the 
context of a fiilly functional data processing system and/or network, those skilled in the art will 
appreciate that the mechanism of the present invention is capable of being distributed in the form 
of a computer usable medium of instructions in a variety of forms, and that the present invention 
appHes equally regardless of the particular type of signal bearing medium used to actually carry 
out the distribution. Examples of computer usable mediums include: nonvolatile, hard-coded 
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type mediums such as read only memories (ROMs) or erasable, electrically programmable read 
only memories (EEPROMs), recordable type mediums such as floppy disks, hard disk drives and 
CD-ROMs, and transmission type mediums such as digital and analog communication links. 

[0049] While the invention has been particularly shown and described with reference to a 
preferred embodiment, it will be understood by those skilled in the art that various changes in 
form and detail may be made therein without departing from the spirit and scope of the 
invention. 
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